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Abstract 

Intuitively speaking, a classical field theory is background-independent 
if the structure required to make sense of its equations is itself subject 
to dynamical evolution, rather than being imposed ab initio. The 
aim of this paper is to provide an explication of this intuitive notion. 
Background-independence is not a not formal property of theories: the 
question whether a theory is background-independent depends upon 
how the theory is interpreted. Under the approach proposed here, a 
theory is fully background-independent relative to an interpretation if 
each physical possibility corresponds to a distinct spacetime geometry; 
and it falls short of full background-independence to the extent that 
this condition fails. 



* Forthcoming in General Relativity and Gravitation. 



1 



1 Introduction 



In in pre-relativistic physics, as in common sense, space and time provide a 
fixed scaffolding or stage against which physical systems evolve. 

Ask an intelligent man who is not a scholar what space and time 
are, and he will perhaps answer as follows. If we imagine all 
physical things, all stars, all light taken out of the universe, what 
then remains is something like a giant vessel without walls called 
'space.' With respect to what is happening in the world, it plays 
the same role as the stage in a theater performance. In this 
space, in this vessel without walls, there is an eternally uniformly 
occurring tick-tock . . .that is 'time.' Most natural scientists, up 
to the present, had this conception about the essence of space 
and time .... (Einstein J26J item 44a]) 

It is often suggested that what makes general relativity (and theories of its 
ilk) special is their background-independence — their treatment of space and 
time as among the actors in the theory rather than as a fixed stage upon 
which the action unfolds. 

'Background-independence' has no precise and fixed meaning. But ac- 
cording to the sense in which the term is used here, general relativity (in 
its usual formulation and under its usual interpretation) is a paragon of 
background-independence, while the theory of a Klein-Gordon field in Minkowksi 
spacetime is a paragon of background-dependence (whether or not the space- 
time metric is treated dynamical variable) Q 

Intuitively, a background-independent theory must be generally covariant— 
any structure that breaks general covariance is background structure. And if 
a theory is generally covariant without being background- independent, then 
its general covariance must in some sense be artificial or dispensable — one 

1 In fact, of course, the term 'background-independence' is used in many ways. The 
use in the present discussion is very close to that found in, e.g., [19] and [37]. Sometimes, 
however, 'background-independence' is used as a synonym for 'generally covariant' |42j— 
under this usage, a theory of a Klein-Gordon field propagating against a Minkowski metric 
counts as background-independent if the spacetime metric is treated as a variable in the 
equations of motion. Again, 'background-independence' is sometimes used as a name 
for a sweeping form of relationalism [41] — and then even general relativity is not fully 
background-independent, because the topological structure of spacetime and the signature 
of the metric are fixed ab initio. For another approach, see |22] . 
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expects general covariance to be broken in any formulation in which the back- 
ground is made explicit. So, speaking very roughly and intuitively, a theory 
is background-independent if and only if its most perspicuous formulation is 
generally covariant (but see the discussion of the Einstein-Maxwell theory in 
Section 17.41 below) . 

The question whether future theories of physics can be expected to be 
background-independent and the question whether various proposed approaches 
to quantum gravity live up to this expectation have been very widely dis- 
cussed in recent years. The goal of the present discussion is to offer an 
explication of the intuitive notion of background-independence as it applies 
to classical field theories with the hope that clear standards of background- 
independence will provide a framework to structure such debates. 

It might be thought that the notion of an absolute object [T| |2| [T4"l [T5 | [16] 
should play a crucial role here. For an absolute object of a theory is a 
field that is in a certain sense the same in each solution — and so it might 
naturally be suggested that a theory is background-independent if and only 
if it features no absolute objects. However, this suggestion runs into serious 
difficulties (see Remark 13.11 below). A different one is offered in its place — 
roughly speaking, that a theory is fully background-independent if alteration 
of the physical degrees of freedom always implies an alteration in geometry; 
and that a theory falls short of full background-independence to the extent 
that this condition fails. 

The analysis is intended to apply only to theories of the universe as a 
whole in which all interactions between systems are taken into account. The 
clearest cases of background-dependent theories are theories featuring fixed 
(i.e., solution-independent) fields. Such fields can play a number of roles 
in a theory — but in practice, they are typically used to represent geometry, 
fixed sources, or external fields. Plausibly, in a theory in which background 
structure is present without being encoded in fixed fields, the background 
still plays one of these same roles. By restricting attention to theories of 
the universe as a whole in which all interactions are taken into account, the 
possible roles of background are narrowed down to a single one — geometry. 

In Section [5] below, a framework for talking about field theories and 
their symmetries is sketched. Section [3] includes a preliminary discussion of 
background-independence illustrated by several examples. Sections H] and [5] 
develop the notions of geometrical degrees of freedom and of physical degrees 
of freedom that are required for the present approach. Various gradations of 
background- (in) dependence are defined in Section O Features and limits of 
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these notions are discussed in Section [7J 

Conventions. All Lie groups are taken to be connected. Where g denotes 
a metric, Ein[g], Weylfg], Riem[g], Riccifg], and R[g] denote, respectively, 
the corresponding Einstein tensor, Weyl tensor, Riemann tensor, Ricci ten- 
sor, and scalar curvature. 

2 Field Theories 

For present purposes, we can take a classical field theory to consist of the 
following elements. 

(a) A connected n-dimensional manifold V, the spacetime of the theory. 

(b) A set of tensors on V. These are the fixed fields of the theory. G is 
often the empty set in cases of interest. 

(c) A set {0i, . . . , 4>k} of dynamical fields on spacetime. A field is specified 
by specifying a type of tensor on V and a particular configuration of the 
field by specifying a tensor of that type. A configuration of the complete 
set of dynamical fields is denoted $ = (0i, . . . , 0^). 

(d) A space /C of field configurations, thought of as consisting of the kine- 
matically possible $. In typical cases, K. is determined by specifying the 
smoothness, asymptotic behaviour, etc. of the fields of the theory. 

(e) A set of differential equations A($; 0) that determines the space S C /C 
of solutions of the theory. The fixed fields play the role of parameters 
rather than variables in A. Any derivative operators appearing in A must 
be definable in terms of the fixed and dynamical fields of the theory. 

The symmetry group, Q, of a classical field theory is the group consisting 
of diffeomorphisms from /C to itself that map solutions to solutions and that 
are suitably local on vH We call the elements of Q the symmetries of the 

2 Here is one way to make this notion precise [46]. If G is a one-parameter group of 
diffeomorphisms from /C to itself, then it makes sense to speak of the infinitesimal generator 
£ of G. £ will be a vector field on /C. In the present setting, a vector (5$ € T$/C, <t> e /C, 
can be identified locally with a tensor field on V. We call G local in V if there is some k 
such that for any x € V the value of £($) € X$/C when evaluated at x depends only on 
the value of $ and its first k derivatives at x. We call G a one-parameter symmetry group 
of the field theory if it is local and maps solutions to solutions. We take Q to be the group 
generated by all of the one-parameter symmetry groups of the theory. 
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theory and write 7 • $ for the result of acting on $ G S by a symmetry 

ie g. 

We will be especially interested in symmetries corresponding to diffeo- 
morphisms of V. We will denote by Diff(V) the group of diffeomorphisms 
V and by Diff c (V) the group of compactly supported diffeomorphisms of V 
(so d G Diff(V) is in Diff c (V) if and only if there is a compact set K C V 
such that d is the identity on V/K). Since we are working with tensor fields, 
each field has a well-defined transformation law h-» d ■ <f) under the action 
of spacetime diffeomorphisms. Further, the action of diffeomorphisms on the 
space of fields is local in V. So the following is a special case of our general 
notion of symmetry. 

Definition 2.1 (Spatiotemporal Symmetry). A diffeomorphism d : V — > V 
is a spatiotemporal symmetry of a theory if $ G S implies d • $ G S. 

Definition 2.2 (Local General Covariance). We call a field theory locally 
generally covariant if every d G Diff c (l / ) is a spatiotemporal symmetry of 
the theory. 

Definition 2.3 (General Covariance). We call a field theory generally co- 
variant if every d G Diff(l / ) is a spatiotemporal symmetry of the theory. 

3 Examples 

In this section, a number of examples are presented as motivation for the 
following claims. 

(1) Background-independence admits of degrees, with theories falling in be- 
tween full background-independence and full background-dependence. 

(2) A degree of background-dependence can result from the imposition of 
asymptotic boundary conditions. 

(3) The notion of background-independence is not purely formal: whether a 
theory is background-independent depends on how it is interpreted. 

3.1 More or Less Background-Dependent Theories 

In paradigm background-dependent theories, solution-independent background 
structure is encoded in fixed fields. 
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Example 3.1. Consider the theory of a scalar field propagating on Minkowski 
spacetime in which the spacetime metric r\ is treated as a fixed field: space- 
time is V — M. n (n > 2); the only fixed field is 77; the only dynamical field is 
a scalar field 0; the space JC of kinematic possibilities is the space of twice- 
differentiable 0; and the equation of motion is the Klein-Gordon equation, 
= 0. Here we have full background-dependence: the Minkowski metric 
7] is provides the fixed stage against which the scalar field evolves. 

Of course one can have background-dependence even in the absence of 
fixed fields. 

Example 3.2. Consider the generally covariant analogue of the theory of 
the preceding example: Let V = M. n (n > 2); let there be no fixed field; let 
there be two dynamic fields, a scalar field <ft and a Lorentz-signature metric 
g; the space K. of kinematic possibilities is the space of twice-differentiable 
4> and g; and the equations of motion are Riem[g] = and D 9 = 0. Each 
solution (g, 0) just represents a Klein-Gordon field on a copy of Minkowski 
spacetime: the physics is exactly the same as in the preceding example. But 
whereas in the preceding example the group of spatiotemporal symmetries 
was given by the Poincare group, here it is the group DifflV) of spacetime 
diffeomorphisms. Despite its general covariance, this theory is background- 
dependent: although the metric g is a dynamical field, the spacetime geom- 
etry is solution-independent and plays the role of stage against which the 
scalar field evolves. 

It possible to cook up variants on this example that, intuitively, fall just 
short of full background-dependence. 

Example 3.3. Let everything be as in the preceding example, except that 
V is taken to have the topology of R x T m , where T m is the m-dimensional 
torus and it is built into the definition of K, that g is globally hyperbolic. 

Once again, the metric g is flat in any solution. But the theory has 
nontrivial global geometric degrees of freedom — the space of metrics modulo 
diffeomorphisms is finite-dimensionalJl 

This theory is of course about as far from being background-independent 
as is possible — at the local level, the spacetime geometry is Minkowskian in 
every solution, and it is this geometry that determines the behaviour of the 

3 For relevant degrees of freedom in the case of three spacetime dimensions, see, e.g., 
[3 [29]. For their operational significance, see [31]. 
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scalar field. But because the global geometry varies from solution to solution, 
it is natural to see the theory as falling (just) short of the full background 
dependence of the previous example. 

The next example is a variation on this theme which raises a point that 
will play a role the discussion below. 

Example 3.4. Let V ~ R x S 3 (S 3 being the three-sphere). We again have 
no fixed fields and two dynamic fields, a scalar field <ft an d a metric g. The 
space of kinematic possibilities is determined by the usual differentiability 
conditions together with the condition that g be geodesically complete. The 
equations of motion are 



This theory describes a scalar field propagating against a de Sitter spacetime 
of constant curvature K 2 : if (g, <f>) is a solution, then (V,g) is isometric to a 
timelike hyperboloid of radius K in Minkowski spacetimejj 

The equations of motion almost succeed in fixing the spacetime geometry 
up to diffeomorphism — in this theory, identifying metrics related by a diffeo- 
morphism leaves us with a single geometric degree of freedom, parameterized 
by K 2 . If we regard scale transformations as well as diffeomorphisms as relat- 
ing physically equivalent solutions, then we should regard this theory as fully 
background-dependent — for then there are no geometrical degrees of freedom 
on the theory. However, if we regard scale transformations as physical, then 
we should regard this theory as nearly but not entirely background-dependent 
in virtue of possessing a single geometrical degree of freedom. 

3.2 More or Less Background-Independent Theories 

At the other end of the spectrum from the examples we have been considering 
lies spatially compact vacuum general relativity, a paragon of background- 
independence. 

4 The equations of motion force (V, g) to be a space of constant curvature K 2 [23l p. 
124]. The assumption that g is geodesically complete then forces (V,g) to be a quotient 
of anti-de Sitter spacetime, Minkowski spacetime, or de Sitter spacetime [45l Theorem 
2.4.9]. But the topology of V rules out the first two options and forces (V,g) to be de 
Sitter spacetime itself. 



Weyl[g] 
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Example 3.5. Let V be an n-manifold of the form f xS where E is compact 
and let the only field be a twice-differentiable globally hyperbolic Lorentzian 
metric g subject to the vacuum Einstein equation, Ricci[g] = 0. This theory 
is generally covariant. It is also background- independent: the geometry of 
spacetime is determined dynamically rather than being imposed ab initio^ 

Just as there are theories that fall just short of background-dependence, 
there are also theories that fall just short of background- independence. 

Example 3.6. Consider the sector of general relativity in which solutions 
are asymptotically flat at spatial infinity in the sense of |3j. Here V = R 4 
and the only field is a Lorentzian metric g subject to the vacuum Einstein 
equation that is required to be twice-differentiable, to be globally hyperbolic, 
and to approximate Euclidean geometry in a suitable sense at spatial infinity. 

A helpful, but dispensable, way to think about these asymptotic bound- 
ary conditions appeals to an ideal boundary added to V that represents 
spatial infinity. In broadest outline, the picture is as follows. We attach a 
non-physical boundary % to V that has the topology and metric of "H the 
unit timelike hyperboloid in Minkowski spacetime; for convenience we fix an 
isometry d : H — » "Ho- Let g be a Lorentzian metric on V and let £ be a 
Cauchy surface relative to g. We say that £ has good asymptotic behaviour 
relative to g if the initial data that g induces on E represents space as asymp- 
totically Euclidean (and satisfies various additional technical conditions) and 
if the set of points <9£ C H, which we call the boundary of E, consisting of 
the limit points of E on H, has the same form as a set on % that arises by 
taking the intersection of "Ho with an spacelike hyperplane (in the standard 
model of Minkowksi spacetime). A metric g on V is in /C if by its lights it is 
possible to partition V by Cauchy surfaces with good asymptotic behaviour. 

The resulting theory is locally generally covariant — since locally its dy- 
namics is given by the Einstein equation. But is is not generally covariant: 
only certain diffeomorphisms from V to itself preserve the asymptotic bound- 
ary conditions. One can picture this as follows. Let 0o be the set consisting 
of all the intersections of T-Lq with spacelike hyperplanes in Minkowski space- 
time. And let := <i -1 O be the corresponding set of subsets of %. Any 
diffeomorphism / : V — » V induces a diffeomorphism /:%—>■ 7i. f is a 

5 Of course, this is not to say that say that there is nothing that is invariant across 
solutions: each solution involves a Lorentz-signature metric, each solution satisfies the 
field equation of the theory, etc. 
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symmetry of our theory if and only if under the obvious action / maps to 
itself. 

The most natural thing to say is that this theory lies between paradig- 
matic non-background-independent theories like those in in which fields prop- 
agate against the backdrop of Minkowski spacetime and paradigmatic background- 
independent theories like spatially compact general relativity. On the one 
hand, there are no fields on spacetime, fixed or dynamical, that encode a 
fixed background structure such as a geometry — indeed, locally the field of 
the theory has all of the freedom of the metric field of ordinary spatially 
compact general relativity. On the other, there is also a sense in which the 
boundary conditions of the theory ensure that any solution has the structure 
of Minkowski spacetime at infinity — and this is reflected in the fact that the 
theory is not generally covariant. Perhaps the point is put most vividly by 
saying that in this theory one has a class of preferred frames at infinity. An 
observation along these lines played an important role in motivating Einstein 
to investigate spatially compact cosmologies [25J . 

3.3 Geometrically Ambiguous Theories 

In each of the above examples, there was among the fields of the theory an 
obvious candidate to represent the geometry of spacetime — and the question 
of the extent to which the theory was background-independent turned on 
the question of the extent to which the behaviour of that field was solution- 
independent. But it is also possible to find theories featuring more than one 
field that could plausibly represent the geometry of spacetime. In such cases 
the question of background-independence becomes more subtle. 

Example 3.7. Consider a theory modelled on Nordstrom's scalar theory of 
gravityjj Let V ~ K 4 . Let there be no fixed field and let there be three 
dynamical fields — two metrics of Lorentz signature, rj and g, and a positive 
scalar field (ft, subject to the usual sort of smoothness conditions. In order to 
state the equations of motion, we also introduce a scalar field T that arises by 
taking the trace relative to g of the ordinary scalar field stress-energy tensor 
for phi relative to g. The equations of the theory fall into three groups. The 

6 For discussions of Nordstrom's theory, see [TU |36j 03] ■ The theory discussed here 
is not quite Nordstrom's, since in his theory the scalar field coupled to the trace of the 
stress-energy tensor of particulate matter. 



9 



first group, 



Riem[?7] = (la) 
a n (f) = -4ttG0 3 T, (lb) 

says that 77 is a flat metric and that is a modified Klein-Gordon field 
with respect to 77 with an interesting source term (here and below G denotes 
Newton's constant). The second group, 

Weylfc] = (2a) 
R[g] = 24ttGT, (2b) 

says that g is conformally flat and has scalar curvature proportional to T. 
The third group, 

g = 4> 2 r] (3a) 
= (-detg)* (3b) 
V = 0(-det#)-*, (3c) 

encodes relations between g on the one hand and and 77 on the other. 
These three sets of equations are highly redundant: in the presence of the 
third group, the first and second groups are equivalent to one another. 

Is this theory background-independent? This depends upon how we un- 
derstand the theoryQ 

On the one hand, one could take and 77 to be the fundamental physical 
fields of the theory and understand the theory as describing the propagation 
of a non-linear scalar field in Minkowski spacetime (g would then just be a 
clever but unphysical way of encoding and 77 in a single object obeying an 
elegant equation). Understood this way, the theory is as fully background- 
dependent as the theory of a Klein-Gordon field in Minkowski spacetime. 

Alternatively, we could think of g as the fundamental physical variable, 
understood as directly representing the geometry of spacetime. In this case 
we would view Equations (I2al) and (I2bl) as giving the laws of a non-linear 
field theory of spacetime geometry (Equations (|3b|) . (|3c|) . and (flbl) would 



7 For discussion of options for interpreting such theories, see 39, §§3. Ill and 3. IV]. 
Note that if one were to couple this theory to particulate matter in the obvious way (as 
Norstrom in fact did) then physical considerations would exert some pressure in favour of 
the second option discussed below [32] . 



10 



then tell us how to rewrite the theory as a theory of a scalar field propagating 
against a non-physical flat metric). In this case we would presumably want 
to consider the theory as being far from background-dependent: just as in 
general relativity, here we have a non-linear theory of a field with infinitely 
many degrees of freedom describing a spacetime geometry that is in general 
variably curved. 

3.4 Morals 

(1) Background-dependence and independence come in degrees: some theo- 
ries are fully background- (in) dependent, others only nearly so — and oth- 
ers fall somewhere in between. 

(2) A theory can fail to be fully background-independent in virtue of asymp- 
totic boundary conditions. 

(3) The extent of the background- (in) dependence of a theory is not a strictly 
formal one: in particular, it depends on how one thinks of the geometric 
structure of each solution and on what sorts differences between solutions 
one takes to be unphysical. 

Running through the discussion of the above examples was the idea that 
a theory is fully background-independent if each physical possibility cor- 
responds to a distinct spacetime geometry, and that it falls short of full 
background-independence to the extent that this condition fails. The task 
of the next several sections will be to put in place a framework of concepts 
that will allow us to make this idea precise. 

Remark 3.1 (Absolute Objects). General relativity was the first physical 
theory in which space and time did not have the objectionable feature of act- 
ing on matter without being acted upon by itfl The notion of an object that 
acts upon others without itself being acted upon is given a precise sense in 
the notion of an of an absolute object due to Anderson and Friedman: an ab- 
solute object of a theory is one that is locally the same up to diffeomorphism 
in every solution [D |2J El EH US] S 

8 This was a favourite theme of Einstein's; see |33j §3-9]. 

9 This notion has its problems — see [35] for a survey. Foremost among them is the fact 
that there are certain types of tensors, such as non- vanishing vector fields or symplectic 
forms, that are always locally identical up to diffeomorphism — so any field represented by 
such an object counts as absolute, no matter what its dynamics. 
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Since an absolute object of a theory is a field that is the same in each 
solution, it may seem natural to take a theory to be background-independent 
if and only if it features no absolute objects. But this suggestion founders 
on the morals reached above. For if one identifies background-independence 
with the lack of an absolute object then: (i) background-independence will 
be an all or nothing affair; (ii) if a theory fails to be background-independent, 
this is always in virtue of a the fact that some field of the theory is an absolute 
object; and (iii) the question of background-independence is rendered purely 
formal. 

4 Geometric Degrees of Freedom 

Suppose that one is handed a field theory (in the sense of Section [2] above). 
So far it is just a piece of mathematics. In order to endow it with physical 
content, one would have to say something about how the fields of the theory 
correlate with observable quantities. In particular, if the field theory is to 
be understood as an all-encompassing account of the classical world, then 
it must be endowed with geometric content: one must have some way of 
thinking of each solution of the theory as portraying physical processes in 
spacetime. 

Let us say that a geometrization for a field theory consists of: (i) a rule 
$ | — > 0$ that associates with each solution $ of the theory a geometric 
structure $3$ for the manifold V on which the fields of the theory live; and 
(ii) a criterion that tells us when geometries and $j<j> 2 are geometrically 
equivalent. 

When we think of a field theory as a relativistic field theory, we are 
implicitly thinking of it as endowed with a geometrization that: (i) assigns 
to each solution $ a Lorentzian metric 0$ on V that is among the fields of 
the theory or definable in terms of them; and (ii) counts such metrics as 
geometrically equivalent if and only they are related to one another by a 
diffeomorphism from V to itself. 

There are of course other notions of geometrization that arise. Pre- 
relativistic field theories can be thought of as involving a geometrization 
that: (i) assigns to each solution a spatial metric, a temporal metric, and 
an affine connection; and (ii) counts two such assignments as geometrically 
equivalent if and only they are related by a diffeomorphism. One might 
also tinker with the standard notion of a relativistic field theory by counting 
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metrics related by scale transformations as geometrically equivalent. 

In most of the examples discussed in the preceding section, a default 
reading of the theory as a relativistic field theory was available, since one 
and only one of the fields of the theory was a Lorentzian metric. In the 
final example discussed there were two fields of this type — and the question 
arose which one should be taken to describe the spatiotemporal geometry 
of solutions. Other situations are of course possible. A field theory may 
admit a natural reading as a relativistic field theory even though there are 
no Lorentzian metrics among; its fields^ Or a theory that includes one or 
more Lorentzian metrics among its fields may admit a natural geometrization 
by whose lights the geometry of a solution is given by some further metric 
(Nordstrom's theory could be put in this form, as could the Brans-Dicke 
theory) . 

Relative to a geometrization for a theory, we can look at the set of space- 
time geometries that arise in solutions of the theory and at the set of equiva- 
lence classes of such geometries under the relation of geometrical equivalence 
of the geometrization. We call any set of variables that parameterize this 
latter set the geometrical degrees of freedom of the theory relative to the 
geometrization. 

Relative to the standard notion of geometrization, the theories of Exam- 
ples 13.11 and 13.21 above have trivial geometrical degrees of freedom (up to 
equivalence, only one geometry occurs); the theories of Examples 13.31 and 13.41 
have finitely many geometrical degrees of freedom (the space of geometries 
modulo equivalence is finite-dimensional); and the theories of Examples 13.51 
and 13.61 have infinitely many geometrical degrees of freedom (the space of 
geometries modulo equivalence is the infinite-dimensional space of Ricci-flat 
Lorentzian metrics modulo diffeomorphisms) Example 13.7} Nordstrom's 
theory, is more subtle: relative to one way of reading the theory, it has 
trivial geometrical degrees of freedom; relative to another, it has infinitely 
many. 

10 Think here of the formulation of (2 + 1) gravity as a Chern-Simons theory |7[. or of 
certain 'parameterized' versions of familiar theories [2T> 1 12"5 1 ES]. 

11 Relative to the unorthodox approach that counts metrics related by scale transforma- 
tions as equivalent, the theory of Section [3T4l has trivial geometrical degrees of freedom. 
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5 Physical Degrees of Freedom 



This section focusses on the way of counting physical degrees of freedom 
that is standard among physicists (an alternative system of counting, more 
popular among philosophers than among physicists, is mentioned in Remark 
O below). 

In many cases, one takes the physical degrees of freedom of a theory to 
parameterize the space of solutionso But there are important exceptions. 

The equations of motion of a classical theory are said to be underdeter- 
mined if they are not independent of one another; for a precise character- 
ization, see [54"| p. 171]. The most prominent types of theories with un- 
derdetermined equations include generally covariant theories and theories of 
Yang-Mills type. But there are many other examples [24]. Uniqueness of so- 
lutions fails radically in theories with underdetermined equations of motion: 
the family of solutions corresponding to an admissible set of initial data is 
infinite-dimensional — roughly speaking, such a family can be parameterized 
by arbitrary functions of the independent variables of the theory. 

Since a theory is deterministic if and only if each instantaneous state is 
compatible with only one global history, there is usually a tight connection 
between a failure of uniqueness of solutions in a theory and the failure of that 
theory to be deterministic. But this connection obtains only if we assume that 
distinct solutions of our theory always represent physically distinct situations. 
Faced with the prospect of a wholesale and dramatic failure of determinism 
in the presence of underdetermined equations of motion, one usually prefers 
to reject this assumption. Standardly one assumes instead that a theory 
featuring underdetermined equations involves gauge freedom — one assumes, 
that is, that some of the degrees of freedom of the theory are unphysical and 
that, except perhaps in special cases, specifying instantaneous values of all 
variables suffices to determine the past and future behaviour of the physical 
degrees of freedom. 

The idea is to introduce introduce an equivalence relation — gauge equiv- 
alence — on the space of solutions, then to identify the physical degrees of 
freedom of the theory with variables that parameterize the quotient space 
that results when we identify gauge equivalent solutions. 

12 Actually this is non-standard — the space of solutions is isomorphic to the system's 
phase space, and the degrees of freedom are usually taken to parameterize the system's 
configuration space. But it is convenient here to omit the factor of one-half that should 
be inserted at various points below. 
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How can we characterize gauge equivalence? Roughly speaking, we want 
to consider solutions to be gauge equivalent if the underdetermination of the 
theory's equation of motion forces us to consider them physically equivalent, 
on pain of taking the theory to be indeterministic. But we cannot just take 
two solutions to be gauge equivalent if and only if they induce the same 
initial data at some instant of time. 

(i) From the fact that solutions $1 and $2 induce the same initial data at 
time £1 and the fact that solutions $2 and $3 induce the same initial 
data at time £2, it need not follow that there is a time at which $1 and 
$3 induce the same initial data. But we need gauge equivalence to be 
an equivalence relation, so we need to set things up so that from the 
fact that $! and $ 2 are gauge equivalent and the fact that $ 2 and $ 3 
are gauge equivalent, it follows that $1 and $3 are gauge equivalent. 

(ii) Simply identifying solutions whenever they induce the same initial data 
will efface genuine instances of indeterminism. The Newtonian n-body 
problem admits solutions in which a system of particles interacts in 
such a way that they all disperse to spatial infinity in finite time [T2] . 
Such a solution matches the trivial empty solution at late times. This 
constitutes real physical indeterminism, not an instance of gauge equiv- 
alence. Similarly, in the case of spatially compact general relativity, we 
will want to consider solutions gauge equivalent if and only if they are 
related by a diffeomorphism. Instances in which globally hyperbolic 
solutions admit non-isometric extensions are instances of genuine inde- 
terminism, not gauge equivalence; for examples see [8]. 

There are a number of ways of addressing the problem of characteriz- 
ing gauge equivalence. For theories in Lagrangian form, one can employ 
by-products of the variational procedure to construct a presymplectic form 
on the space of solutions of the theory and take two solutions to be gauge 
equivalent if and only if they can be connected by a (piecewise) null curve of 
this presymplectic form [TUl dU HH] . Alternatively, one can follow the Dirac 
constraint algorithm, which leads from a Lagrangian to a notion of gauge 
equivalence on the space of initial data of the Hamiltonian formulation of 
the theory [131 1211 121] ■ These two approaches are closely related to one 
another [28J. 

There is also a low-tech procedure that gives the same answers in stan- 
dard applications. Let us take for granted that we know, for any solution $, 
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which hypersurfaces in V correspond to initial data surfaces (=instants of 
time). Then we can make the following definitions. Let us say that a sym- 
metry 7 of a theory is a spoiler relative to solution $ if there is some initial 
data surface S C V relative to $ such that 7 acts as the identity on some 
neighbourhood U of £0 We call solutions $1 and $2 spoiler-related if there 
is a spoiler 7 relative to $1 such that $2 = 7" Being spoiler-related is not 
an equivalence relation (for the sort of reasons raised in point (i) above). We 
call an equivalence relation R an extension of the relation of being spoiler- 
related if any two spoiler-related solutions are also i?-related. There are 
many equivalence relations between solutions that extend the relation of be- 
ing spoiler-related. One such is the relation that takes any two solutions 
to be equivalent. That relation is maximally strong. We are after a much 
weaker one (i.e., one according to which fewer pairs of solutions are equiv- 
alent). Heuristically, what we want is the weakest equivalence relation that 
extends the spoiler- relatedness 1^1 For technical convenience, we take gauge 
equivalence to be the weakest equivalence relation on the space of solutions 
that extends spoiler-relatedness and which has equivalence classes that form 
submanifolds of the space of solutions — if there is any such relation. 

One can show (modulo certain technical questions) that for various sectors 
of general relativity this definition is well-formed and gives the same answer 
as the high-tech Lagrangian approach [6]. In particular for spatially compact 
general relativity, both approaches agree that solutions are gauge equivalent 
if and only if they are related by diffeomorphisms. And for well-behaved 
asymptotic boundary conditions at spacelike, timelike, or null infinity, both 
approaches agree that solutions are gauge equivalent if and only if they are 
related by diffeomorphisms that act as the identity at the appropriate asymp- 
totic boundary!^] 

To recap. Whichever way one handles the details, the idea is that we 
take solutions to be gauge equivalent if the formalism of the theory forces us 



13 I.e., for any x £ U, (7 ■ <&)(x) = $(2;). In vacuum general relativity, the spoilers are 
the diffeomorphisms that act as the identity on a neighbourhood of a Cauchy surface; in 
Maxwell's theory set in Minkowski spacetime (with a fixed metric) the spoilers are the 
transformations of the form A t— > A + dA where A is a scalar function that vanishes on a 
neighbourhood of a hyperplane of simultaneity. 

14 This the the relation R that takes solutions $ and $' to be equivalent if and only if 
there are solutions $1, . . . , such that $1 = $,$„ = and each is spoiler-related 
to (i = 1, . . 1). 

15 In some cases, this requires taking initial data surfaces to correspond to (partial) 
Cauchy surfaces with good asymptotic behaviour. 
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to consider them physically equivalent on pain of recognizing rampant and 
uninteresting indeterminism. We then identify the space of physically distinct 
possibilities described by the theory with the space of solutions modulo gauge 
equivalence, and identify the physical degrees of freedom of the theory with 
any set of variables that parameterizes that quotient space. 

Remark 5.1 (Non-Standard Counting). On the standard approach dis- 
cussed above, symmetries that relate gauge-equivalent solutions have a quite 
different status from other symmetries: indeed, the idea of gauge equivalence 
is often introduced by saying that whereas global symmetries like Lorentz 
symmetries relate physically distinct situations, the symmetries characteris- 
tic of Yang-Mills theories are non-physical and their presence an indication 
of redundancy in the variables of the theory. 

There is an alternative approach on which all symmetries are on equal 
footing: in order to find the degrees of freedom of a theory, one has to quotient 
out all symmetries — including those that are not gauge in the present sense. 
Thus, in the case of a theory whose equations of motion are not under- 
determined, one must quotient the space of solutions by the spatiotemporal 
and other symmetries of the theory in order to arrive at the true degrees of 
freedom. For this approach, see [Kit HTj. 

6 Background-Independence 

In paradigm background-dependent theories, the geometry is the same from 
solution solution to solution: vary the physical degrees of freedom as you 
will, you cannot alter the geometry of spacetime. What is the very opposite 
of such behaviour? A theory in which any change in the physical degrees 
of freedom makes a difference to the geometry of spacetime. This motivates 
the following definitions. 

Definition 6.1 (Full Background-Dependence). A field theory is fully background- 
dependent if it has no geometrical degrees of freedom: every solution is as- 
signed the same spacetime geometry as every other solution. 

Definition 6.2 (Full Background- Independence). A field theory is fully background- 
independent if all of its physical degrees of freedom correspond to geometrical 
degrees of freedom: two solutions correspond to the same physical geometry 
if and only if they are gauge equivalent. 
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Note that both of these definitions presuppose that a notion of geometriza- 
tion is in place; the second also requires a notion of gauge equivalence. 

In effect, these definitions measure the degree of background-independence 
by looking at how many of the physical degrees of freedom correspond to 
geometrical degrees of freedom. The limiting cases are full background- 
dependence (in which there are no geometrical degrees of freedom) and full 
background-independence (in which all physical degrees of freedom are geo- 
metrical degrees of freedom). Intermediate cases are also possible. 

Definition 6.3 (Near Background-Dependence). A field theory is nearly 
background- dependent if it has only finitely many geometrical degrees of free- 
dom: quotienting the space of geometries that arise in solutions of the theory 
by the relation of geometrical equivalence yields a finite-dimensional space. 

Definition 6.4 (Near Background- Independence). A field theory is nearly 
background-independent if it has a finite number of non-geometrical degrees 
of freedom: there is some N such that for any geometry arising in a solution 
of the theory, the space of gauge equivalence classes of solutions with that 
geometry is no more than iV-dimensional. 

These definitions give the desired verdicts concerning the examples of 
Section [3] above. 

The theories of Examples 13.11 and 13.21 describe a Klein-Gordon field in 
Minkowski spacetime (with or without the metric as a fixed field). Under 
the natural geometrization, these theories are fully background-dependent: 
each solution has the geometry of Minkowski spacetime, so there are no 
geometrical degrees of freedom] 16 ! 

The theory of Example 13.31 concerns a scalar field propagating against a 
flat but dynamical metric on a spacetime with toroidal spatial topology. 
Under the natural geometrization it is nearly background-dependent: each 
solution has the local geometry of Minkowski spacetime, but a finite number 
of global geometrical degrees of freedom remain. 

The theory of Example 13.41 describes a scalar field propagating against a 
de Sitter metric. Under the natural geometrization, the theory is nearly 

16 Here and below, the natural geometrization of a theory treats the theory as a rela- 
tivistic field theory in the sense of Section |4] above, with the geometry of each solution 
given by the metric tensor that appears as one of the fields of the theory. 
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background-dependent: the theory has a single geometrical degree of free- 
dom (the value of the curvature constant). Note, however, that the theory 
would count as fully background-dependent if one chose to count metric 
tensors related by scaling transformations as geometrically equivalent. 

The theory of Example 13.51 is spatially compact vacuum general relativity. 
Under the natural geometrization and notion of gauge equivalence, the 
theory is of course fully background-independent: two solutions share the 
same geometry if and only if they are related by a diffeomorphism if and 
only if they are gauge equivalent. 

The the theory of Example 13. 61 is vacuum general relativity with asymptotic 
flatness imposed at spatial infinity. Under the natural geometrization and 
the standard notion of gauge equivalence, the theory is nearly background- 
independent: two solutions correspond to the same geometry if and only 
they are related by a diffeomorphism; two solutions are gauge equivalent if 
and only if they are related by a diffeomorphism asymptotic to the identity 
at spatial infinity; if one quotients the family of solutions sharing a given ge- 
ometry by the relation of gauge equivalence, the result is a ten-dimensional 
spaceo Note, however, that if one were to adopt the non-standard method 
of counting physical degrees of freedom discussed in Remark l5.1l above. then 
this theory would count as fully background- independent. 

The theory of Example 13.71 is inspired by Nordstrom's bimetric theory of 
gravity: its fields include a metric tensor rj that is flat in every solution 
and a metric tensor g that varies from solution to solution. If the first 
of these fields is taken to give the spacetime geometry of the theory, then 
the theory is fully background-dependent (spacetime is R 4 , so there are 
no global degrees of freedom). If instead the metric g is taken to give 
the spacetime geometry, then the theory is not background-dependent, nor 
even nearly background-dependent: there are infinitely many geometrical 
degrees of freedom. 

17 The quotient of the group of diffcomorphisms by the group of diffcomorphisms acting 
as the identity at spatial infinity is the Poincare group. This group acts as a symmetry 
group on the space of gauge equivalence classes. See [3]. 
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7 Discussion 



The account of background- (in) dependence advanced above embodies the 
three morals developed in Section [3j (i) Background dependence and inde- 
pendence come in degrees, (ii) Background-independence can be spoiled by 
imposition of asymptotic boundary conditions, as well as by the presence of 
solution-invariant fields in the theory, (iii) The status of a theory vis-a-vis 
background-dependence and independence can depend on the interpretation 
given to the theory as well as on its formalism. 

In this final section, a number of loose ends and further topics are taken 
up: the possibility of translating the framework into a setting that allows 
more general sorts of field theories; further niceties concerning the depen- 
dence of background- (in) dependence on how theories are understood; fur- 
ther points about asymptotic boundary conditions; and how adding matter 
to general relativity affects the question of background-independence. 



7.1 More General Field Theories 

So far, a relatively narrow notion of a classical field theory has been em- 
ployed, according to which all fields are tensor fields. More generally, one 
might allow configurations of fields to be given by sections of arbitrary fi- 
bre bundles over spacetime. The notions of geometrization, spoilers, and 
gauge equivalence of Sections H] and [5] above carry over to this more general 
setting — and so the notions of background- (in) dependence given in Section 
|6] continue make sense. Whether these definitions still lead to reasonable 
results is a question for further investigation. Of course, not all fields have 
a well-defined transformation law under diffeomorphisms. So the notion of 
general covariance only makes sense for certain types of fields — those fields 
whose configurations are sections of a natural bundle over spacetime (see 

])• 



7.2 Dependence on Interpretation 

We have already seen that the notions of background-dependence, background- 
independence, etc., are not strictly formal notions: the status of given theory 
may depend on how the formalism of a theory is interpreted. This fact is 
reflected in the definitions given in SectionOabove: the degree of background- 
independence of a theory may depend on which of the fields of a theory are 
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taken to represent geometrical structure, on the standards used to determine 
whether two geometrical structures are 'the same,' and on whether one views 
solutions represented by (non-gauge) symmetries as always corresponding to 
the same physical situation!^! 

Here is another illustration. In setting up a model of an elastic continuum 
(a beam, a shell, a string . . . ) in a fixed spacetime background one can work 
with so-called material variables [30J: one begins with a space B whose 
points correspond to the material points of the continuum and identifies a 
history of the continuum with a map from B := B x R to a Lorentzian 
manifold (Vo,<7o) (with a diffeomorphism onto its image and the image of 
each {b} x R, b G B, a timelike curve). Such a theory can be considered 
a classical field theory in more or less the sense of Section [2] above, with 
B being the 'spacetime,' and the dynamical field of the theory being the 
Vo-valued field 0. There is an obvious way to geometrize such a theory by 
viewing each solution as endowing B with the structure of a relativistic 
spacetime — just pull g back to B via 00 If one now applies the definitions 
of Section [HI one typically finds that so-considered the theory is far from 
being background-dependent — in general distinct embeddings and 0' lead 
to non-isometric pull-backed metrics on B (unless boundary conditions have 
been imposed that fix the shape of the boundary of 4>{B)). But of course 
this is all based on a misreading of the theory. The physical spacetime of the 
theory is Vo, not B — and the theory counts as fully background-dependent 
when reformulated of as a classical field theory with Vq as its spacetime and 
with physical geometry given by the fixed field go. 

7.3 Asymptotic Boundary Conditions 

An important feature of the apparatus of Section O is that it allows one to 
isolate the sense in which the imposition of asymptotic boundary conditions 
can spoil background-independence. In the case of general relativity, the 
sorts of asymptotic boundary conditions usually discussed can be thought of 
in the following terms. One adjoins to the n-dimensional physical spacetime 
manifold V a non-physical (n — l)-dimensional boundary dV endowed with 

18 For illustrations, see the discussions in the preceding section of Nordstrom's theory, 
of the de Sitter-based theory, and of general relativity with asymptotic flatness imposed 
at spatial infinity. 

19 The Nambu-Goto action for string theory takes cf>*g to be the fundamental dynamical 
variable. One can do something similar in continuum mechanics [5]. 
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some sort of geometrical structure. Kinematically possible $ are required to 
exhibit good behaviour as they approach dV; likewise initial data surfaces 
should be asymptotic to subsets of dV that are geometrically nice. We denote 
by T> the group of spatiotemporal symmetries of our theory and by T>q the 
group of spatiotemporal symmetries asymptotic to the identity at dV. In 
typical examples V is a proper subgroup of V which is a proper subgroup of 
Diff(K). Further, one typically finds that T> is a normal subgroup of T> and 
that the quotient group G = V/V can be thought of as acting geometrically 
at dV (i.e., every diffeomorphism in T> induces an isomorphism from dV 
to itself, with diffeomorphisms that differ by an element of T>q inducing the 
same isomorphism). When this obtains, G is called the asymptotic symmetry 
group. 

For the most common sorts of asymptotic boundary conditions for vac- 
uum general relativity, one knows or expects that (relative to a suitable 
notion of initial data surface) solutions are gauge equivalent if and only if 
they are related by a diffeomorphism asymptotic to the identity at dV — and 
hence that (when all goes well) the asymptotic symmetry group acts on the 
space of gauge equivalence classes of solutions (for several examples and one 
counter-example, see [6]). 

If all of this obtains, then under its natural reading the theory is nearly 
background-independent so long the asymptotic symmetry group G is finite- 
dimensional — as it was in Example 13.61 above (where G was the Poincare 
group). But even when this picture holds, strange things can happen: when 
one imposes asymptotic flatness at null infinity, the asymptotic symmetry 
group is the infinite-dimensional Bondi-Metzner-Sachs group (see, e.g., [18]); 
and one expects that this group acts on the space of gauge equivalence 
classes of solutions [HI example 4]. This means that the theory comes out as 
not nearly background-independent: even after the geometry of spacetime is 
fixed, an infinite number of physical degrees of freedom remain free. This 
is an unsettling conclusion — but perhaps it can be accepted as being of a 
piece with the strangeness of the symmetry group of this sector of general 
relativity!^ 

One might also worry about a problem at the other end of the spectrum. 
What if one were to impose asymmetric asymptotic boundary conditions? 
E.g., what if one were to modify the standard framework for asymptotically 

20 It is sometimes suggested that the complicated asymptotic structure of this sector is 
a reflection of the fact that radiation can reach conformal infinity in this setting 0] . 
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anti-de Sitter spacetimes in such a way that the asymptotic symmetry group 
became trivial [5]? According to the present approach, the resulting theory 
would be fully background-independent, despite the presence of asymptotic 
boundary conditions! Presumably, however, this sort of theory is ruled out 
by our decision to focus on theories of the entire universe — it is difficult to 
see how the need for asymmetric asymptotic boundary conditions could arise 
for theories of this type. 

7.4 General Relativity with Matter 

So far, in considering general relativity, we have limited attention to the 
vacuum case. What happens when matter is introduced? That depends. 

Consider first what happens if one includes a scalar field <fi that sees but 
is not seen by the metric g (i.e., <fi obeys the Klein-Gordon equation relative 
to g but g obeys the vacuum Einstein equation). Under the obvious way of 
understanding this theory, it has the same geometrical degrees of freedom as 
vacuum general relativity — so it is certainly not anything like background- 
dependent according to the present approach. But corresponding to each 
way that the geometry of spacetime could be, there is a vast number of ways 
of arranging the physical degrees of freedom inherent in the scalar field — 
so the theory is not anything like background-independent according to the 
present approach. This is an intuitively attractive verdict: this theory is 
a sort of hybrid of vacuum general relativity (the paradigm of background- 
independence) and a theory in which the spacetime metric affects but is not 
affected by matter (which feature has often been taken to be a hallmark of 
background-dependence) . 

Of course, the theory of the preceding paragraph is a pathological ex- 
ample. At the other end of the spectrum lies general relativity coupled to 
dust. Given a solution to the Einstein-dust equations, one can reconstruct 
the dust trajectories and energy density from knowledge of the spacetime 
metric and the equations of motion (see, e.g., [38, §3.14]). So if two solutions 
agree in their geometry, they agree with respect to all physical degrees of 
freedom — the theory is fully background- independent. 

More typical are cases that lie somewhere in the vast middle ground. The 
Einstein-Maxwell theory is an interesting example. In effect it splits into a 
fully background-independent sector and a sector that exhibits the sort of 
behaviour seen in the hybrid theory above: in generic solutions, one can re- 
construct the electromagnetic field from knowledge of the geometry and the 
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equations of motion; but there exist exceptional solutions — those involving 
null electromagnetic fields — featuring geometries that are compatible with a 
huge range number of configurations of the electromagnetic field [17]. Here 
we have a theory that falls short of full background-independence in an il- 
luminating way (and which serves to illustrate the inadequacy of the naive 
equivalence between a theory's being background-independent and its having 
as its most natural formulation a generally covariant one). It would be desir- 
able to extend the framework of Section [6] to provide comparative measures 
of background-independence for theories like this one that fall short of near 
background-independence. 
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